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(54) Parallel turbo coder implementation 

(57) In order to achieve a turbo coder block having 
an increased processing speed it is proposed to carry 
out a paralleli7ation of degree n. As a result each paral- 
lelized turbo coder block comprises a storage unit 
(10, .... 17) to store n samples (l(t-1), .... I(t-n)) of an input 
signal l(t) to the parallelized turbo coding block and at 
least one storage unit (QO, Q7) to store n samples 
(Q|(t), Qj(t-(n-1)) of at least one output signal Qj(t) 
(j=1 M) of the parallelized turbo coding block. Fur- 



ther, the parallelized turbo coderblock comprises a bank 

of n delay units (X 1 X N ) and is adapted to a parallel 

processing of n samples of the input signal l(t) such that 

at least two delay units (X 1 X N ) of the bank directly 

receive subsets of the n samples (l(t-1) 5 I (t-n)) of the 
input signal l(t) and an output signal of at least one delay 
unit (Xt, X N ) in the parallelized turbo coder block is 
supplied to at least two delay units in the parallelized 
turbo coder block. 
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Description 

FIELD OF INVENTION 

5 [0001] The present invention relates to a turbo coder implementation, and in particular the present invention relates 
to a parallelization of a turbo coder implementation. 

BACKGROUND ART 

io [0002] The base band part of a transmitter in a telecommunication system usually includes together with other parts 
a general encoder for bits to be transmitted. The encoder adds redundant information to an incoming data stream with 
bit size K. Thereby, the number of bits is increased by a factor of 1/r, depending on the coder rate r. Currently, coder 
rates of 1 12 and 1 /3 are commonly used, although other rates are possible as well. As result for each block of K uncoded 
bits the encoder outputs a block of K/r coded bits. 

is [0003] At a receiving side for the data stream in the telecommunication system the original data stream is recalculated 
from the coded data block in a receiver even in case some bits get corrupted during transmission over, e.g., the air 
interface. 

[0004] Recently, turbo coders have been introduced for the purpose of error control for the transmission of telecom- 
munication data. In general, turbo coding involves applying two or more component codes to different interleaved 
20 versions of the same information sequence prior to transmission. As turbo coding is well known in the art, e.g., Berrou 
et a!., "Near Shannon Limit Error - Correcting Coder and Decoder; Turbo- Codes", IEEE International Communication 
Conference : pp. 1064-1070, May 1993 and Sklar, "A Primer on Turbo Code Concepts", IEEE Communications Mag- 
azine, pp. 94-102, December 1997, no further details will be given here and the stated references are incorporated 
herein by reference. 

25 [0005] An example for a turbo coder is shown in Fig. 8. As shown in Fig. 8, the turbo encoder consists of two turbo 
coder blocks TCB1 , TCB2 being identical in structure. The difference for the two turbo coder blocks is that one receives 
the bits of the input block in an unchanged order, whereas the other receives the input bits in an interleaved order. For 
each input bit., e.g., three output bits are generated at the output 0, 1, and 2, respectively. 

[0006] As shown in Fig. 8, the turbo coder block TCB1 comprises a first XOR gate 100 at its input and a second 
30 XOR gate 102 at its output. In between there are arranged three delay units 1 04 to 108 for the delay of the respective 
input bits. Similarly, the turbo coder block TCB2 comprises a third XOR gate 110 at its input and a fourth XOR gate 
112 at its output. In between, there are provided three delay units 114 to 118 that delay the respective input bits. 
[0007] As also shown in Fig. 8, the input signal to the turbo coder is supplied directly to the turbo coder block 1 while 
it is supplied via an interleaver 120 to the second turbo coder block TCB2. For the output 0 the input signal is also 
35 directly forwarded without any modification. 

[0008] Operatively, parameters for the turbo coder shown in Fig. 8 are the number of delay units in each turbo coder 
block and further the supply of input signals to the different XOR gates 100, 102, 110, and 112. 
[0009] A straightforward implementation of the turbo coder shown in Fig. 8 relies on the use of delay units as well 
as input and output registers (not shown in Fig. 8) such that turbo coding is done in a bitwise manner. Here, all delay 
to units work with the same system clock so that the output of a delay unit represents the input thereof at the previous 
clock cycle. 

[0010] While the straightforward implementation of the turbo coder using the delay units in a serial manner does not 
require many registers and XOR gates the main disadvantage is that turbo coding is executed in a serial way. This 
means that only one bit is coded per cycle of the system clock. In conclusion, for cases where high bit rates are 
^5 necessary, the system clock frequency is to be increased to very high values. 

[0011] If, e.g., 1200 channels are to be encoded each being related to a voice channel with 100 bits in a time period 
of 1 msec, the necessary system clock frequency is 120 Mhz. 

[0012] Here, it would be very difficult to implement a related turbo coder, e.g., using ASIC or FPGA technology. 
[0013] While one solution would be to implement a dedicated turbo coder for each channel, this would require a 
50 complicated handling of input and output bit streams since different channels still must be coded in parallel. This would 
lead to a very complex control logic to supply the right input at the correct time to the right turbo coder. Further, the 
outputs of the different turbo coders also would have to be handled in the same complicated way. 

SUMMARY OF INVENTION 

55 

[0014] In view of the above, the object of the invention is to increase the processing speed of turbo coder blocks. 
[0015] According to the present invention this object is achieved through a parallel realization of turbo coders. 
[0016] Heretofore, the structure of a turbo coder block is initially described using a general formalized description 
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which then forms the basis for a parallelization of the turbo coder block. 

[0017] In particular, input samples to the turbo coder block are interpreted as elements of a parallel input vector of 
dimension n, where n is the degree of parallelization. 

[0018] The general formalized description of the turbo coder block is then used to derive a mapping of this parallel 

5 input vector into at least one parallel output vector. 

[0019] In more detail, an internal state variable substitution process is applied to each internal state variable of the 
general formalized description wherein the representation of the internal state variables is scanned for maximum time 
index elements which are then substituted through carrying out backward time index transitions using previously de- 
termined time index substituted representations of the internal state variables. These substitution steps are repeated 

10 until the representation of each internal state variable is only dependent on input vector elements and values for internal 
state variables of the turbo coder being delayed according to the degree of parallelization. 

[0020] Further the substitution process is carried out as well for each element of each parallel output vector. Again 
the representation of each vector element in each output vector is scanned for internal state variables having maximum 
time index and then a backward time index transition in the representation of the vector element is determined recur- 
15 sively until the representation of the vector element is only dependent on input vector elements and values for internal 
state variables being delayed according to the degree of parallelization. 

[0021] Therefore, according to the present invention it is proposed to parallelize turbo coder blocks so that only one 
parallelized turbo coder block is necessary instead of many serial turbo coders to achieve an increased processing 
speed. This leads to the decisive advantage that no complicated input and output control for a plurality of turbo coder 
20 blocks is necessary. 

[0022] E.g., for a foui bit parallel turbo coder block the resulting system clock frequency in the example mentioned 
above lies in the range of 30 MHz and therefore may be easily realized using FPGA or ASIC technology. Therefore, 
the parallelized turbo coder blocks and the turbo coder derived therefrom achieve a speed-up over the serial turbo 
coder according to the degree of parallelization so that pre-defined specification may be met using FPGA or ASIC 
25 technology-based implementations. Thus : the parallelized turbo coder block may form the basis for complicated tele- 
communication systems having low processing delay in the base band part without complicated handling of different 
channels at the same time. 

[0023] Further, the parallelized turbo coder block and related turbo coder require only a minimum additional surplus 
in logic gates and registers when being compared to the serial turbo coder. 
30 [0024] Still further, when processing blocks in front and subsequent to the turbo coder also support a parallel process- 
ing manner, the complete encoder block in total requires less logic and registers than the encoder block with a serial 
turbo coder and some additional converters (parallel to serial and vice versa). 

[0025] According to another preferred embodiment of the present invention there is also provided a computer pro- 
gram product directly loadable into an internal memory of a computer comprising software code portions for performing 
35 the steps according to the inventive parallelization method when being run on a computer. Preferably, the software 
code portions are of the VHDL type. 

[0026] Therefore, the present invention allows to achieve a basis for a fast development and modification of parallel 
turbo coder design with decreased number of development cycles and increased flexibility with respect to the mapping 
of the design to different hardware technologies, e.g., ASIC or FPGA : respectively. 



BRIEF DESCRIPTION OF DRAWINGS 

[0027] In the following, preferred embodiments of the present invention will be described with reference to the en- 
closed drawings in which: 



40 



45 



Fig. 1 



shows a circuit diagram of a turbo coder block having one output; 



Fig. 2 



shows a circuit diagram of a turbo coder block having a plurality of outputs; 



so 



Fig. 3 



shows the mapping of a time sequential stream of data inputs into a parallel input vector and the mapping 
of a parallel output vector into a time sequential stream of data outputs; 



Fig. 4 



shows a circuit diagram of a serial implementation of a specific turbo coder block; 



55 Fig. 5 



shows a circuit diagram for a two bit parallel implementation of the turbo coder block shown in Fig. 4; 



Fig. 6 



shows a circuit diagram for a four bit parallel implementation of the turbo coder block shown in Fig. 4; 
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Fig. 7 A shows a circuit diagram of the input part of an eight bit parallel implementation of the turbo coder block 
shown in Fig. 4; 

Fig. 7B shows a circuit diagram for an output part of the parallel implementation of the turbo coder block shown in 
5 Fig. 4; and 

Fig. 8 shows a schematic diagram of a turbo coder comprising an interleaver as well as turbo coding blocks TCB1 
and TCB2. 

io DESCRIPTION OF PREFERRED EMBODIMENTS 

[0028] In the following, preferred embodiments of the present invention will be described with reference to Figs. 1 to 7. 
[0029] In particular, initially a formal description of a turbo coder block as shown in Fig. 1 and 2 will be given as basis 
for the description of the parallelization method according to the present invention. 
75 [0030] Then, it will be described with reference to Fig. 3 how a time sequential stream of data inputs may be mapped 
into a parallel input vector to be handled in the parallelized turbo coder block and how the parallel output vector of this 
parallelized turbo coder block may again be mapped into a time sequential stream of data outputs. 
[0031] Subsequent hereto different examples for a parallel implementation of turbo coder blocks using the paralleli- 
zation method according to the present invention will be given with reference to Fig. 4 to 7. 

20 

1. Turbo Coder Block with a Serial Implementation, General Description: 

[0032] Fig. 1 shows a circuit diagram of a turbo coder block using a serial implementation. 

[0033] As shown in Fig. 1 , the turbo coder block comprises N delay units ; X v X 2 , ... X N such as flip flops, e.g.. The 
25 output signals of the delay units X-,, X 2 X N will be referred to as x-j(t), x 2 (t)...x N (t), respectively, where t denotes the 
integer time index. At the input of the first delay unit X-,, there is provided an input XOR gate 12 and at the output of 
the last delay unit X n , there is provided an output XOR gate 14. 

[0034] As also shown in Fig. 1, an input signal l(t) supplied via an input delay unit 16 and the output signal of the 
output XOR gate 14 is forwarded via an output delay unit 16. 
30 [0035] As also shown in Fig. 1, the output signals x 1 (t), x 2 (t), .... x N (t) can be supplied to the input XOR gate 12 via 
connections a-,, a^, .... cr. N . Also, the input signals to the delay units can be supplied to the output XOR gate 14 via 
connections |i 0 , .... |5 no . Further, the output of the delay unit Xm is supplied to the output XOR gate 14 via a con- 
nection [i N . 

[0036] By formally assigning a value of 0 or 1 to each of the connections cx lt 02,..., u N and further to each of the 

35 connections (3 0t |J 1 |3 N it is possible to describe any serial turbo coder block implementation with N delay units 

according to: 
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x 1 (t) = I(t-l) © ai-Xitt-l) © a 2 x 2 (t-l) © . © a N x N (t-l) 

ai e {0, 1} 



x 2 (t) = x x (t-1) 



(l.Xi) 



(l.x 2 ) 



x N (t) = x N _! (t-1) 



Q(t) = Po-[I(t-l) © a 1 -x 1 (t-l) 

© a 2 x 2 (t-1) 

© ... © a N oc N (t-l) ] © 

(3 r x 1 (t-l) © P 2 -x 2 (t-l) © ... © pN'X N (t-l) 

= Po-I (t-1) © 

Po-oti-xi (t-1) ©Pi-xxU-l) © 



(1 .x N ) 
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5 



0 o a N x N (t-l) e (3 N x N (t-l) = 



TO 



= po-Kt-1) © 



75 



20 



x N (t-l)-[(3 0 aN © PnI 



Pi e {0, 1} 



(l.Q) 



25 



[0037] As shown in Fig. 2, the general description given above for a turbo coder block having a single output may 
be generalized to a turbo coder block using a serial implementation and having a plurality of outputs. 
[0038] As Fig. 1 also Fig. 2 shows delay units X v X 2 , . .. X N of which the output signals are again referred to as x 1 
(t), x 2 (t), x N (t). The output signals x^t), x 2 (t), x N (t) of each delay unit X v X 2 , ... X N are supplied to the input XOR 
30 gate 1 2 via connections cl v a 2 ,..., rx N already shown in Fig. 1 Also, the input data stream is supplied to the input XOR 
gate 1 2 via the input delay unit 16. 

[0039] The serial turbo coder block shown in Fig. 2 differs over the previously discussed turbo coder block in that a 
plurality of outputs Q^t) Q M (t) is provided for. As shown in Fig. 2, for each output there is provided a related output 
XOR gate 14-1, ... , 14-M. At the output of each such output XOR gate 14-1, 14-M there is connected a related 

35 output delay unit 18-1, .... 18-M. 

[0040] As also shown in Fig. 2, each input to the delay units X 1} X 2 , .... X N can be supplied to the first output XOR 
gate 14-1 via connections |i 1 0 , (i K1> p 1 2 . P1.N-1. further to the second output XOR gate 14-2 via connections (3 2 Q; (i 21} 
P2.2. P2,n-i. etc - Further, the output of the last delay unit X N is supplied via a connection (J n N to the first output XOR 
gate 14-1, via a connection [3 2 N to the second output XOR gate 14-2, etc.. Finally., the output signal of each output 

40 XOR gate is delayed to derive the turbo coder block output signals Q^t), Q M (t). 

[0041] The formal description of the turbo coder shown in Fig. 2 differs over the previously discussed in that for each 
output Q^t), ... Q M (t) a related output representation is generalized to: 
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Qj (t) = pjo-l (t-D © 

Xl (t-l)-[pjO-«i © Pjil © 



10 



x N (t-l)-[p jo a N © p jN ] 



75 



Pij s {0, 1} 
j e [1, . . . , 



M] 



(l.Q) 
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[0042] The above formal representations of a turbo coder block in serial implementation having either one or a plu- 
rality of outputs will serve as a basis of the description of the parallelization method according to the present invention. 
[0043] However, before the parallelization method according to the present invention will be described in the follow- 
25 jng, initially a mapping of a serial data input stream into a parallel input vector and also the mapping of a parallel output 
vector back into a serial data output stream will be described with reference to Fig 3. 

2. Turbo Coder Block Using a Parallel Implementation of Degree n: 

30 [0044] On the upper left side, Fig. 3 shows a time sequence according to a serial data input stream. Similarly, Fig. 
3 shows in the upper right part a time sequence according to a serial output stream. 

[0045] As shown in Fig. 3 : a parallel input vector 10, 11 , ln-1 may be derived from a series of data input values l(t-1), 
l(t-2), .... I(t-n) according to: 
I(t-1) = l0(p-1) 
35 |(t-2) = H(p-1) 



l(t-n) = In-I(p-I) 

40 [0046] Assuming that the parallel input vector shown in Fig. 3 derived from the serial data input stream is processed 
in a parallel turbo coder block to be decribed in the following, one arrives at a parallel output vector QjO, .... Qjn-1 . 
[0047] As shown in Fig. 3, this parallel output vector may then be mapped back to a serial data output stream Qj 
(t), .... Qj(t-(n-1)) according to: 



45 



Qj (t) 
Qj (t- 



= QjO(p) 
= QjKp) 



50 
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2.1 General Outline of Parallelization 

[0048] According to the present invention there are used different time scales in the serial domain and the parallel 
domain. In other words, a set of n serial data inputs in the serial time domain corresponds to a parallel input vector 

5 that is processed in parallel according to one clock cycle in the parallel time domain p. 

[0049] In conclusion, the resulting parallel system clock is n times slower than the serial system clock assuming that 
the same amount of data is to be processed. Therefore, the serial time expression (t-1 ) may be replaced by the parallel 
time expression (p-1) and a sequence of n serial time inputs is re-written as a single parallel input vector, one parallel 
clock cycle earlier (p-1). Similarly, the series of outputs is re-written as parallel output vector for the current parallel 

10 cycle (p) . 

[0050] Using the general description of a turbo coder block according to Figs. 1 and 2 and further the mapping of 
the serial data input stream into a parallel input vector and the reverse mapping of the parallel output vector into a 
serial output data stream the approach according to the present invention relies on the supply of a parallel input vector 
having n serial data input items to a parallelized turbo coding block and the calculation of M output vectors each 

*s comprising n output data items therein. 

[0051] The n elements of the output vector(s) depend only on values of the internal delay units X n X N at n system 

clocks of the serial turbo coder block implementation earlier - which corresponds to only one cycle in the parallelized 
turbo coder block - and further on all input data items during these n cycles summarized in the parallel input vector. 
[0052] As will be shown in the following, the approach underlying the present invention relies on the determination 

20 of equations represenling Ihe output elements of the parallel output vector and the output values of the internal delay 
units of the parallel turbo coder block for the next parallel cycle in dependence of output values of the internal delay 
units of the parallel turbo coder block and the parallel input vector for the previous parallel cycle. 
[0053] According to the serial representation (see eq. (1 .Q)) each output value of a serially implemented turbo coding 
block at time t is calculated from output values of the delay unit X 1 , X N at time t-1 and the input data item at time t-1 . 

25 [0054] By replacing all output values of the internal delay units with values at one system clock cycle for the serial 
implementation earlier and through repeating these replacement steps in a recursive manner until only output values 
at n serial system clock cycles earlier are left there may be derived equations for the parallel calculation of the output 
results over n serial cycles or equivalently one parallel cycle. 

[0055] In other words, these equations form the basis of the parallelized turbo coder blocks as n serial clock cycles 
30 represent one parallel clock cycle. 

2.2 Backward Time Index Substitution for First Internal State: 

[0056] In preparing the recursive substitution process mentioned above one may use the representation for the output 
35 of the first delay unit X, (of. eq. (1 .x^) 

x 1 (t) = I(t-l) 0 arxitt-l) © 

40 a 2 -x 2 (t-1) © ... © 

a N -x N (t-l) 

which is also referred to as first internal state in the following to express this internal state for the serial clock cycles t- 
1, .... t-(n-l) according to: 
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I (t-2) © otl X! (t-2) © 
a 2 x 2 (t-2) © 
a N x N (t-2) 



. © 



<2. Xl .l) 



10 



IS 

X!(t-(n-l)) = I(t-n) ©ai-xi(t-n) © 

a 2 -x 2 (t-n) © ... © 
20 a^ x^ ( t-n) 

(2.xi-n-l) 

25 2.3 Backward time index substitution for further internal states (i=2, ...,N): 

[0057] Further, the representation for the outputs of the additional delay units X 2 , X N (see eq. (1 x 2 ) (1 x N )) 
X|(t) -x M (t-1) 

which are also referred to as further internal states in the following may be used to express these internal states for 
30 the serial clock cycles t-1. t-(n-1) according to: 

Xj_ (t-1) = Xi_! (t-2) 

3S <2.Xi.l) 



40 

Xi (t- (n-1) ) = xi-i (t-n) 
45 (2.Xi.n-l) 

2.4 Backward time Index substitution for output vector elements: 
so [0058] Still further, the representation for the at least one output of the turbo coder block (of. eq. (1 Q)) 
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Qj (t) = PjQ-I (t-D © 

X! (t-D tpjo-ai © PjlJ © 



x N (t-l)-[p j0 -a N © P jN ] 



15 

may be used to express this output tor the serial clock cycles t-1 , .... 1-(n-1 ) according to: 



20 



25 



Qj (t-1) = Pjo l (t-2) © 

X! (t-2)-[pj 0 -cti © Pji) © 



(2.Q.0) 



x N (t-2) (p j0 a N © p jN ] 
30 (2.Q.1) 



35 



40 Qj (t- (n-1) ) - 

PjO-Kt-n) © 

Xl (t-nj-tpjoa! © pji] © 



45 



50 x N (t-n) [Pj 0 a N © p jN ] 

(2 .Q.n-1) 
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2.5 Parallelization through recursive backward time index transition steps: 
2.5.1 Object 

5 [0059] As already outlined above the object underlying the parallelization process according to the present invention 

is to express the elements of each parallel output vector - one per output 1 M shown in Fig. 2 - as a function of a 

series of n serial data input items and the internal states one parallel cycle - equivalent to n serial cycles - before the 
current parallel cycle: 

w 

Qj(t-i) =f[I(t-l), I(t-n); x x (t-n) , x N (t-n)] 

for i e [0, . . . , n-1 ] 
15 j E [1, . . M] 

o 

Qj i (p) = f[lO(p-l), In-l(p-l); x^p-l), x N (p-l)]. 

20 

(2.Qi) 

25 2.5.2 Recursive time index transition steps 

[0060] Firstly, to achieve this goal all internal states are parallelized. For all internal states x,(t), x N (t) initially a 
maximum time index to be used in the representation of this internal state is set to t-1 according to a serial description 
of operation 

30 [0061] Then the representations of all internal states are considered and each is recursively scanned to substitute 
internal state variables dependent on the current maximum time index by a formula expressing the same internal state 
in dependence of a time index preceeding this current maximum time index using one of equations (2) listed above. 
[0062] Therefore the representation of the considered internal state is modified in a way that a transition is made 
from the current maximum time index to the time index preceeding the current maximum time index. This transition 

35 will be referred to as backward time index transition in the following. After the backward time index transition is achieved 
for all internal state variables depending on the current maximum time index the current maximum time index is de- 
creased by a value of 1. 

[0063] This recursive repetition of this process continues until the maximum time index reaches a value of t-n. The 
reason for this is that at this point the considered interna! state at cycle t (p) is described in dependency only of internal 
40 states at time t-n (p-1 ) and serial input data items I (t-1 ), .... I(t-n). 

[0064] Secondly, all elements in each of the output vectors are parallelized by carrying out again backward time 
index transitions in a recursive manner as described above for the internal states. 

[0065] Further, it should be mentioned that at intermediate stages between the recursive time index transition steps, 
at modification of the current maximum time index, the respective representations of internal states and elements of 
45 output vectors may be simplified using the relation (a © a © b - b). In other words double terms (a 0 a) may be 
cancelled in the respective representations. 

[0066] The recursive parallelization of the serial turbo coder block in the most general form as shown in Fig. 2 may 
be summarized as follows: 



55 
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procedure irecurslve^parallellza t±on (n) 

int i, j, k; 
{ 



/* parallelization of internal states */ 



for( k=l; k<=N; k++ ) 
{ 

/* consider internal state x^ft) */ 

while ( tmax > t-n ) 
{ 

scan representation of internal state x^ft) 
for internal states with maximum time index 
tmax' 

for ( all internal states with maximum time 
index ) 
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{ 

execute backward time index transition 
from t max to t max -l through state variable 
substitution using eq. (2) above; 

} 

cancel double terms in representation of 
internal state x^U); 
tmax = t max -l; 



/* parallelization of output variables */ 

for( j=l; j<=M; j++ ) 
{ 

f or ( i=0; i<=n-2; i++ ) 
{ 

/* consider output vector element Qj(t-i) */ 
tmax = t-i-1; 
while ( tmax > t-n ) 

{ 

scan representation of Qj (t-i) for 
internal states with maximum time index; 



f or ( all internal states with maximum time 
index ) 
( 

execute backward time index 
transition from t max to t max -l 
so through state variable substitution 

using eq. (2) above; 

} 

55 
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cancel double terms in representation of 

Qj (t-i) ; 

t max = tmax -1 '' 

} 

} 

} 

} /* end procedure */ 

is [0067] To illustrate the application of the parallelization approach generally described above, in the following, this 
parallelization process will be explained in more detail with reference to specitic examples. 

[0068] As will be shown in the following each parallelized turbo coder block comprises a storage unit, e.g., a first 
group of flip flops 10, 17 to store n samples l(t-1 ), .... I(t-n) of the input signal l(t) for further supply to the parallelized 
turbo coding block and at least one storage unit, e.g., a second group of flip flops QO, Q7 to store n samples Qj 

20 (i), Q|(i-(n-1 )) of the at leasl one oulpul signal Qj(L) : j-t , M, of the parallelized lurbo coding block. 

[0069] Further, the parallelized turbo coder block comprises a bank of n delay units , .... X N and is adapted to a 
parallel processing of n samples of the input signal l(t) such that at least two delay units X-j, .... X N of the bank directly 
receive subsets of the n samples l(M), .... I(t-n) of the input signal l(t) and an output signal of at least one delay unit 
X v . . . , X N in the parallelized turbo coder block is supplied to at least two delay units in the parallelized turbo coder block. 

2S [0070] Fig. 4 shows a circuit diagram of a specific serial turbo coder block as it may be used in the turbo coder shown 
in Fig. 8. 

[0071] Here, it should be noted that the provision of an input delay unit 16 and an output delay unit 18 per se is not 
a prerequisite in case of a serial implementation but becomes necessary in case of parallization to collect the elements 
of the input vectors and output vector as shown in Fig. 3. 
30 [0072] The serial turbo coder block shown in Fig. 4 may be shortly described by N=3, M=1 , a - [a A , a 2 , a 0 ] = [0, 1 , 
1]. P - [Pi. P 2 > P3. P4] = H . 1.0. 1] using the formal description according to eq. (1) introduced above. In the following 
it will be explained how this serial turbo coder block may be parallalized into a n= 2 ; 4. and 8 bit parallel turbo coder block. 

3.1 Example: 2 bit parallel turbo coder for N=3, M=1, a =[0, 1, 1], p = [1, 1, 0, 1] 

35 

[0073] Starting from equation (1) we obtain 

Xl (t) = l(t-1) e x 2 (M) ©x 3 (t-1) 
x 2 (t) = x 1 (t-1) 

40 X 3 (t) = X 2 (M) 

Q(t) = l(M)©x 1 (M)©x 2 (t-1) 

as description for the turbo coder block shown in Fig 4. The following backward time index transitions over two serial 

time cycles are necessary: 
45 x l (t) = l(t-1)©x 2 (t-1)©x 3 (t-1) 

<rr> Xl (t) = l(M) ©x^t-2) ©x 2 (t-2) 

which is in view of the parallel circuit: 

=> Xl (p) = I0(p-1 ) e Xl (p-1 ) e x 2 (p-1 ) 
x 2 (t) = x 1 (t-1) 
so <=> x 2 (t) = |(t-2) © x 2 (t-2) © x 3 (t-2) 

which is in view of the parallel circuit: 

=> x 2 (p) = 11 (p-1 ) CD x 2 (p~1 ) CD x 3 (p-1 ) 
x 3 (t) = x 2 (t-1) 

<=>x 3 (t) = x 1 (t-2) 
55 which is in view of the parallel circuit: 

=:> x 3 (p) -x^p-1) 

Q(t) = l(t-1)e Xl (t-1)® x 2 (t-1) 

<=> Q(t) = l(M ) © [l(t-2) (35 x 2 (t-2) © x 3 (t-2)] © x,(t-2) 
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which is in view of the parallel circuit: 

=> O0(p) = I0(p-1) © I1(p-1) © x^p-1) © x 2 (p-1)fl)x 3 (p-1) 

Q(t-1 ) = l(t-2) © x, (t-2) © x 2 (t-2) 
which is in view of the parallel circuit: 
5 Q1 (p) = II (p i ) © x^p-1 ) © x 2 (p-1 ) 

[0074] Fig. 5 shows the result of this parallelization process, i.e. a circuit diagram for a two bit parallel implementation 
of the turbo coding block shown in Fig. 4. 

3.2 Example: 4 bit parallel turbo coder for N=3, M=1, a = [0, 1, 1], [3 = [1, 1, 0, 1] 

10 

[0075] Starting from equation (1 ) we obtain 
Xl (t) = l(t-1)ex 2 (t-1)0x 3 (t-1) 
x 2 (t) = x 1 (t-1) 
x 3 (t) = x 2 (t-1) 

'5 Q(t) = l(M)©X 1 (t-1)©X 2 (t-1) 

as description tor the turbo coder block shown in Fig. 4. The following backward time index transitions over four serial 
time cycles are necessary: 

x 1 (t) = i(t-i)ex 2 (t-i)©x 3 (M) 

<=> x^t) = l(M) © x^t-2) ©x 2 (t-2) 
20 <=> Xl (l) = 1(1-1) © l(l-3) © x 2 (l-3) © x 3 (l-3) © x^t-3) 
<=z> Xl (t) = l(t-1) © l(t-3) © x^t-4) ©x 2 (t-4) © 

l(t-4) © x 2 (t-4) © x 3 (t-4) 
which is in view of the parallel circuit: 
=> x^p) = I0(p-1 ) © I2(p-1 ) © I3(p-1) © x^p-1 ) © x 3 (p-1 ) 

25 X 2 (t) = X 1 (t-1) 

<=> x 2 (t) = l(t-2) © x 2 (t-2) © x 3 (t-2) 
<=> x 2 (t) = I (t-2) © x,(t-3) © x 2 (t-3) 
<-> x 2 (t) =r |(t-2) © [l(t-4) © X 2 (t-4) © x 3 (t-4)] © x^t-4) 
which is in view of the parallel circuit: 
30 => x 2 (p) = 11 (p-1 ) © I3(p-1 ) © x n (p-1 ) © x 2 (p-1 ) © x 3 (p-1 ) 
x 3 (t) = x 2 (t-l) 
<=> x 3 (t) = x 1 (t-2) 

<=> x 3 (t) = [l(t-3) © x 2 (t-3) © x 3 (t-3)] 

<=> x 3 (t) = l(t-3) © x^t-4) © x 2 (t-4) 
35 which is in view of the parallel circuit: 

=> x 3 (p) = I2(p-1 ) © Xl (p-1 ) © x 2 (p-1 ) 
Q(t) = l(t-1)©x 1 (t-1)©x 2 (t-1) 

<=> Q(t) = l(t-1 ) © [l(t-2) © x 2 (t-2) © x 3 (t-2)] © (t-2) 

<=> Q(t) = l(t-1) © l(t-2) © x^t-3) ©x 2 (t-3) © 
40 [l(t-3)©x 2 (t-3)©x 3 (t-3)] 

<=> Q(t) = l(M) © l(t-2) © l(t-3) © 

[l(t-4) © x 2 (t-4) © x 3 (t-4)] © x 2 (t-4) 

which is in view of the parallel circuit: 

=> Q0(p) = I0(p-1) © 11 (p-1 ) © I2(p-1 ) © I3(p-1 ) © x 3 (p-1 ) 
45 Q(t-i ) = |(t-2) © x, (t-2) © x 2 (t-2) 

<=> Q(M ) = l(t-2) © [1(1-3) © x 2 (t-3) © x 3 (t-3)] © 
Xl (t-3) 

<=> Q(M ) = I (t-2) © l(l-3) © x n (1-4) © x 2 (l-4) © 
[l(t-4)©x 2 (t-4)0x 3 (t-4)| 
50 which is in view of the parallel circuit: 

=> Q1(p) = 11 (p-1) © I2(p-1) © I3(p-1) ©x^p-1) ©x 3 (p-1) 

Q(t-2) = l(t-3) © x^t-3) © x 2 (t-3) 
<=> Q(t-2) = l(t-3) © [l(t-4) © x 2 (t-4) © x 3 (t-4)] © 
x,(t-4) 

55 which is in view of the parallel circuit: 

=> Q2(p) = I2(p-1) © I3(p-1) © x,(p-1) © x 2 (p-1) © x 3 (p-1) 

Q(t-3) = l(t-4) © x^t-4) © x 2 (t-4) 
which is in view of the parallel circuit: 
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Q3(p) = I3(p-1) ©x^p-1) ©x 2 (p-1) 
[0076] Fig. 6 shows the result of this parallelization process, i.e. a circuit diagram for a four bit parallel implementation 
of the turbo coding block shown in Fig. 4. 

3.3 Example: 8 bit parallel turbo coder for N=3, M=1, a = [0, 1, 1], [5 = [1,1, 0, 1] 

[0077] Starting from equation (1 ) we obtain 
Xl (t) = l(t-1)©x 2 (t-1)©x 3 (t-1) 
x 2 (t) = x 1 (t-1) 
x 3 (t) = x 2 (t-l) 

Q(t) = l(t-i)e Xl (t-i)@x 2 (t-i) 

as description for the turbo coder block shown in Fig. 4. The following backward time index transitions over eight serial 
time cycles are necessary: 

Xl (t) = l(M)0x 2 (t-1)0x 3 (t-1) 
<=> x-, (t) = I (t-1 ) e Xt (t-2) © x 2 (t-2) 
<=> Xl (t) = I (t-1 ) © [l(t-3) © x 2 (t-3) © x 3 (t-3)j © Xl (t-3) 
<=> Xl (t) = I (t-1) © I (t-3) ©X-,(t-4) ©x 2 (t-4) © 

[l(t-4)©X 2 (t-4)©X 3 (t-4)J 
<=> Xl (t) = l(t-1) © I (t-3) © l(t-4) © 

[l(t-5) © x 2 (t-5) © x 3 (t-5)] © x 2 (t-5) 
<=> Xl (t) = I (t-1) © I (t-3) © l(t-4) © l(t-5) ©x 2 (t-6) 
<=> Xl (t) = I (t-1) © I (t-3) © l(t-4) © l(t-5) © x^t-7) 
<_> Xl (t) =• I (t-1) © l(t-3) © l(t-4) © l(t-5) © 

[l(t-8)©x 2 (t-8) (Dx 3 (t-8)] 
which is in view of the parallel circuit: 
=> x,(p) = I0(p-1) © I2(p-1) © I3(p-1) © I4(p-1) © 

I7(p-i)©x 2 (p-i)ex 3 (p-i) 

x 2 (t) = x l (t-1) 
<=> x 2 (t) = [I (t-2) © x 2 (t-2) © x 3 (t-2)] 
<=> x 2 (t) = I (t-2) © x^t-3) © x 2 (t-3) 
<=> x 2 (t) = I (t-2) © [l(t-4) © x 2 (t-4) © x 3 (t-4)] © 
Mt-4) 

<=> x 2 (t) = I (t-2) © l(t-4) © x^t-5) ©x 2 (t-5) © 

P(t-5)©x 2 (t-5)©x 3 (t-5)l 
<=> x 2 (t) = I (t-2) © l(t-4) © l(t-S) © 

[I (t-6) © x 2 (t-6) © x 3 (t-6)] © x 2 (t-6) 
<=> x 2 (t) = l(t-2) © lft-4) © l(t-5) © I (t-6) © x 2 (t-7) 
<=> x 2 (t) = I (t-2) © l(t-4) © l(t-5) © l(t-6) © x^t-6) 
which is in view of the parallel circuit: 
=> x 2 (p) = 1 1 (p- 1 ) © I3(p-1 ) © I4(p-1 ) © I5(p-1 ) © x, (p-1 ) 

X 3 (t) = x 2 (t-1) 
<=>x 3 (t) = x 1 (t-2) 

<=> x 3 (t) = [l(t-3) © x 2 (t-3) © x 3 (t-3)j 

<=> x 3 (t) = I (t-3) © x^t-4) © x 2 (t-4) 

<=> x 3 (t) = I (t-3) © [l(t-5) © x 2 (t-5) © x 3 (t-5)] © x-,(t-5) 

<=> x 3 (t) = I (t-3) © l(t-5) © x^t-6) ©x 2 (t-6) © 

[l(t-6)©x 2 (t-6)©x 3 (t-6)] 
<=> x 3 (l) = 1(1-3) © 1(1-5) © 1(1-6) © 

[l(t-7) © x 2 (t-7) © x 3 (t-7)] © x 2 (t-7) 
<=> x 3 (t) = l(t-3) © l(t-5) © l(t-6) © l(t-7) ©x 2 (t-8) 
which is in view of the parallel circuit: 
=> X 3 (p) = I2(p-1 ) © I4(p-1 ) © I5(p-1 ) © I6(p-1 ) © X 2 (p-1 ) 

Q(t) = l(M)© Xl (t-1)©x 2 (t-1) 
<=> Q(t) = I (t-1 ) © [I (t-2) © x 2 (t-2) © x 3 (t-2)] © x^t-2) 
<=> Q(t) = |(t-1) © l(t-2) © x,(t-3) ©x 2 ((t-3) © 

[l(t-3)©x 2 (t-3)©x 3 (t-3)j 
<r:> Q(t) = l(t-1) © l(t-2) © l(t-3) © 

[l(t-4) © x 2 ((t-4) © x 3 (t-4)J © x 2 ((t-4) 
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<=> Q(t) = l(t-1 ) © l(t-2) © l(t-3) © l(t-4) © x 2 ((t-5) 
<=> Q(t) = l(M) © l(t-2) © l(t-3) © l(t-4) ©x^t-S) 
<=> Q(t) = l(t-1 ) © l(t-2) © 1(t-3) © l(t-4) © 

[l(t-7)©x 2 (t-7)©x 3 (t-7)] 
5 <=> Q(t) = l(t-1) © l(t-2) © l(t-3) © l(t-4) © l(t-7) © 

x 1 (t-8)©x 2 (t-8) 
which is in view of the parallel circuit: 
=> Q0(p) = I0(p-1) © I1(p-1) © I2(p-1) © I3(p-1) © 

I6(p-1)@x 1 (p-1)©x 2 (p-1) 
10 Q(t-1 ) = |(t-2) © Xt (t-2) © x 2 (t-2) 

<=> Q(M ) = l(t-2) © [l(t-3) © x 2 (t-3) © x 3 (t-3)] © 

Xl (t-3) 

<=> Q(t-1) = l(t-2) © l(t-3) © x^t-4) © x 2 ((t-4) © 
[l(t-4)©x 2 (t-4)©x 3 (t-4)J 
'5 <=> Q(M ) = l(t-2) © l(t-3) © l(t-4) © 

[l(t-5) © x 2 (t-5) © x 3 (t-5)j © x 2 (t-5) 
<=> Q(t-1) = l(t-2) © l(t-3) © l(t-4) © l(t-5) ©x 2 (t-6) 
<=> Q(t-1 ) = I (t-2) © l(t-3) © l(t-4) © l(t-5) © Xt (t-7) 
<=> Q(t-1 ) = l(t-2) © l(t-3) © l(t-4) © l(t-5) © 
20 [|(i-8) © x 2 (1-8) © x 3 (l-8)]] 

which is in view of the parallel circuit: 
=> Q1 (p) = 11 (p-1 ) © I2(p-1 ) © I3(p-1 ) © I4(p-1 ) © 
I7(p-1)©x 2 {p-1)©x 3 (p-1) 
Q(t-2) = l(t-3) © x^t-3) © x 2 (t-3) 
25 < = > Q(t-2) - l(t-3) © [l(t-4) © x 2 (t-4) © x3(t-4)] © 
Mt-4) 

<=> Q(t-2) = l(t-3) © l(t-4) © x,(t-5) © x 2 (t-5) © 
[l(t-5)©x 2 (t-5)©x 3 (t-5)] 

<=> Q(t-2) = l(t-3) © l(t-4) © l(t-5) © 
30 [|(t-6) © x 2 (t-6) © x 3 (t-6)] © x 2 (t-6) 

<=> Q(t-2) = l(t-3) © l(t-4) © l(t-5) © l(t-6) © x 2 (t-7) 

<=> Q(t-2) = l(t-3) © l(t-4) © l(t-5) © l(t-6) © Xt (t-8) 

which is in view of the parallel circuit: 

=> Q2(p) = I2(p-1 ) © I3(p-1 ) © 14(p-1 ) © I5(p-1 ) © Xt (p-1 ) 
35 Q(t-3) = l(t-4) © X, (t-4) © x 2 (t-4) 

<=> Q(t-3) = l(t-4) © [l(t-5) © x 2 (l-5) © x 3 (t-5)] © 
x^t-5) 

<=> Q(t-3) = l(t-4) © l(t-5) © x^t-6) © x 2 (t-6) © 

[l(t-6)©x 2 (t-6)©x 3 (t-6)] 
40 <=> Q(t-3) = l(t-4) © l(t-5) © l(t-6) © 

[l(t-7) © x 2 (t-7) © x 3 (t-7)] © x 2 (t-7) 
<=> Q(t-3) = l(t-4) © l(t-5) © l(t-6) © l(t-7) © x 2 (t-8) 
which is in view of the parallel circuit: 
=> Q3(p) =13(p-1) © I4(p-1) © I5(p-1) © I6(p-1) ©x 2 (p-1) 
45 Q(t-4) = l(t-5) © Xt (t-5) © x 2 (t-5) 

<=> Q(t-4) = I (t-5) © [l(t-6) © x 2 (t-6) © x 3 (t-6)] © 

Xl (t-6) 

<=> Q(l-4) = I (1-5) © 1(1-6) © x^l-7) © x 2 (l-7) © 
[l(t-7)©x 2 (t-7)©x 3 (t-7)] 
50 <=> Q(t-4) = l(t-5) © l(t-6) © l(t-7) © 

[I (t-8) © x 2 (t-8) © x 3 (t-8)] © x 2 (t-8) 
which is in view of the parallel circuit: 
=> Q4(p) = I4(p-1) © I5(p-1) © I6(p-1) © I7(p-1) © x 3 (p-1) 
Q(t-5) = l(t-6) © x, (t-6) © x 2 (t-6) 
55 <=> Q(t-5) = l(t-6) © [l(t-7) © x 2 (t-7) © x 3 (t-7)] © 
x,(t-7) 

<=> Q(t-5) = l(t-6) © l(t-7) © x^t-8) © x 2 (t-8) © 
[l(t-8)©x 2 (t-8) © x 3 (t-8)] 
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which is in view of the parallel circuit: 

=> Q5(p) =15(p-1)<w I6(p-1) © I7(p-1) © x^p-1) © x 3 (p-1) 

Q(t-6) = l(t-7) e x, (t-7) © x 2 (t-7) 
<=> Q(t-6) = I (t-7) © [l(t-8) © x 2 (t-8) © x 3 (t-8)] © 
s Xl (t-8) 

which is in view of the parallel circuit: 

=> Q6(p) = I6(p-1) © I7(p-1) e x^p-1) © x 2 (p-1 ) © x 3 (p-1) 

Q(t-7) = l(t-8) © x, (t-3) © x 2 (t-8) 
which is in view of the parallel circuit: 
10 => Q7(p) = I7(p-1 ) © x,(p-1 ) © x 2 (p-1 ) 

[0078] Fig. 7A and 7B show the result of this parallelization process., i.e. a circuit diagram for an eight bit parallel 
implementation of the turbo coding block shown in Fig. 4. 

4. VHDL Code 

15 

[0079] The recursive parallelization described above in a step-by-step manner may be used with logic synthesis 
tools to compare different design alternatives. Without restriction to any such particular system, here, reference is 
made to the Very High Speed Hardware Discription Language VHDL as being a typical example for the design of logic 
circuits using logic synthesis toois. 
20 [0080] An advantage of the VHDL Code approach to the implementalion of Ihe turbo coder block parallelization result 
is that a design may be realized in a very short time without being bound to a specific technology, e.g., ASIC or FPGA. 
[0081] Thus, the actual technology is not specified through the turbo coder block parallelization result but may be 
set as parameter to be handled by the logic synthesis tool subsequent to coding and thus be changed easily to compare 
different design options. 

25 [0082] In the following, VHDL codes arc listed for the 2, 4, and 8 bit parallelized turbo coder blocks shown in Fig. 5, 
6, and 7. Since the use of the VHDL code language is known in the art and the relation to the parallelization results 
explained above are self-evident no further explanation of details will be given here. 
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LIBRARY ieee; 

USE ieee . std_logic_1164 .ALL; 
USE ieee . std_logic_ari th . ALL; 



ENTITY en_turbo_coder_rtl IS 
PORT ( 

General: 
reset_p : 
cl)c32m ; 



input_8 
input_4 
input_2 
input 1 



IN STD_LOGIC; 

IN STD_LOGIC; Clock 

(rising edge triggered) 
IN std_logic_vector (7 DOWNTO 0) ; 
IN std_logic_vector (3 DOWNTO 0) ; 
IN std__logic_vector (1 DOWNTO C) ; 
IN std logic; 



turboCoding 2 bit parallel 

output_8 : OUT std_logic_vector (7 DOWNTO 0) 

output_4 : OUT std_logic_vector ( 3 DOWNTO 0) 

output_2 : OUT std_logic_vector ( 1 DOWNTO 0) 

output_l : OUT std__lo.gic 

) ; 

END en_turbo_coder_rtl ; 

ARCHITECTURE rtl OF en turbo coder rtl IS 



1095660A1_I.> 
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15 



20 



25 



30 



35 



SIGNAL 


Sl_xl 


: std_logic; 






^ THMAT. 


O X A. Z. 
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. oiu xoyxc/ 
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• oLU XOyXC/ 






O J- LJlN-ttXj 


ci ■; 

O X X 
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o lci xogxc, 
















SIGNAL 


s2_xl 


: std_logic; 










SL.a XOCJxC/ 






O X oINJtlXj 


S Z X J 


sua xOgxC/ 






O X vjlN-rt I j 


o Z X 


sua logic veClor (x 


DOWN 1 0 


0) ; 




o Z (J 


sua xogxc vecior \± 


UUWlN 1 u 


u) ; 


SIGNAL 


s4_xl 


: std__logic; 






C T PMft T 


S fi XZ 


: std logic; 






C T i^MTi T 


s 4 x 3 


: std logic; 






QTfKTAT 


S H X 


sia xogic vecLor ( j 


LJUWN I (J 




C:Tr , KT2VT 
o X LjiN/VXj 


s 4 o 


t std logic vector (3 


/"*\ T*T "K T T 

UOWNIO 


0) ; 


SIGNAL 


s8_xl 


: std_logic; 






SIGNAL 


s8__x2 


: std^logic; 






SIGNAL 


s8 x3 


: std_logic; 






SIGNAL 


s8__i 


: std_logic_vector (7 


DOWNTO 


0) ; 


SIGNAL 


s8_o 


; std_logic_vector (7 


DOWNTO 


0) ; 



40 



BEGIN 



45 



tc_l : PROCESS (clk32m, reset_p) — seriell building of 

turbo coder block TCB 



so 



BEGIN 



IF reset p = f l' THEN 



55 
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10 



15 



20 



25 



sl_xl 


<= 


•0' 


sl_x2 


<= 


' 0' 


sl_x3 


<= 


' 0' 


sl_i 


<= 


•0' 


si o 


<= 


'0' 



ELSIF c 1 k3 2m ? EVENT AND clk32m = ? l f THEN 



sl_i 


<= 


input_l ; 










sl_xl 


<= 


sl_i XOR 




_x2 


XOR 


sl_x3; 


sl_x2 


<= 


sl_xl; 










sl__x3 


<= 


sl_x2; 










Sl_0 


<= 


sl_i XOR 


sl_ 


x2 


XOR 


sl_xl; 



END IF; 



30 



35 



END PROCESS tc_l; 
output_l <= sl_o; 
tc 2: PROCESS (clk32m, reset p) 



— 2bit parallel building 
of turbo coder block 



40 



BEGIN 



45 



50 



55 



IF reset_p = ' 1' THEN 

s2_xl <= ' 0' ; 

s2_x2 <= ' 0 f ; 

s2_x3 <= 1 0' ; 

s2_i <= (OTHERS => ' 0'); 

s2 o <= (OTHERS => 1 0' ) ; 
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ELSIF clk32m 1 EVENT AND clk32m = f l ! THEN 
s2_i <= input__2; 

s2_xl <= s2_i(0) XOR s2_xl XOR s2_x2; 
s2_x2 <= s2_i(l) XOR s2_x2 XOR s2_x3; 
s2_x3 <= s2_xl; 
s2_o (0) <= s2_i (0) XOR 

s2_i(l) XOR s2_xl XOR s2_x2 XOR s2_x3; 
s2_o(l) <= s2_i(l) XOR s2_xl XOR s2_x2; 

END IF; 

END PROCESS tc_2; 
output 2 <= s2 o; 



30 tc_4: PROCESS (clk32m, reset__p) — 4bit parallel building 

of turbo coder block 

35 BEGIN 

IF reset_p = ' 1' THEN 

40 



55 



s4 xl 


<= 


' 0 f ; 


s4_x2 


<= 


1 0' ; 


s4_x3 


<= 


1 0' / 


s4_i 


<= 


(OTHERS => f CP ) ; 


s4 o 


<= 


(OTHERS => f 0* ) ; 



ELSIF clk32m' EVENT AND clk32m = '1' THEN 
s4_i <= input_4; 

s4 xl <= s4 i(0) XOR s4 i(2) XOR s4 i(3! 
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XOR 


s4 


xl XOR s4 x3; 


s4 


x2 


<= 


s4 


i (1) 


XOR 


s4 


i (3) XOR s4 xl 












XOR 


s4 


_x2 XOR s4_x3; 


s4 


x3 


<= 


s4 


i (2) 


XOR 


s4 


_xl XOR s4 x2; 


s4 


o (0) 


<= 


s4 


i (0) 


XOR 


s4 


i (1) XOR s4 i (2) 












XOR 


s4 


i(3) XOR s4 x3; 


s4 


_o(l) 


<= 


s4 


_i (1) 


XOR 


s4 . 


_i (2) XOR s4_i (3) 












XOR 




_xl XOR s4_x3; 


S4 


_o(2) 


<= 


s4 


_i(2) 


XOR 


s4 


_i (3) XOR s4_xl 












XOR 


s4 


_x2 XOR s4_x3; 


s4 


_o(3) 


<= 


s4 


_i(3) 


XOR' 


s4 


_xl XOR s4_x2; 



END IF; 
END PROCESS tc_4 ; 
output_4 <= s4_o; 

30 

tcJB: PROCESS (clk32m, resetjp) — 8bit parallel building 

35 

of turbo coder block 

BEGIN 

40 

IF reset_p = '1 T THEN 

<= ■ 0 ' ; 

<= ' 0'; 

<= 1 0 1 ; 

<= (OTHERS = > 1 0 1 ) ; 

<= (OTHERS => ' 0' ) ; 

ELSIF clk32m l EVENT AND clk32n\ = 'T THEN 



s8_xl 
s8_x2 
s8_x3 
s8_i 
s8 o 
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s8_i <= input_8; 



So 


X X 




S O 




AUK 


s o 


_i (2) 


XOR s8_i (3) 












AUK 


S o 


_i (4) 


XOR s8_i (7) 












\r /-XT") 

X OR 


So 


x2 XOR s8_x3; 


So 


xz 


< = 


S o 


_1 ( 1 ) 


XOR 


So 


_i (3) 


XOR s8_i (4) 












XOR 


o 

So 


_i (5) 


XOR s8_xl; 


S O 


X J 


< = 


S 0 


_l (2 ) 


XOR 


S o 


_i (4) 


XOR s8_i (5) 












XOR 


s8 


_i (6) 


XOR S8_x2; 


S o 


_o ( 0) 


< = 


So 


_l (0) 


XOR 


S o 


_i (1) 


XOR s8_i (2) 












XOR 


So 


_i (3) 


XOR s8_i (6) 












XOR 


s 8 


_xl XOR s8_x2; 


S o 


_0 ( 1 ) 


< = 


_ o 

S 0_ 


_1 (1) 


XOR 


s 8 


_i (2) 


XOR s8_i (3) 












XOR 


So 


_i (4) 


XOR s8_i (7) 












XOR 


S O 


_x2 XOR s8_x3; 




_o (2) 


< = 


s8_ 


_i (2) 


XOR 


s8 


_i (3) 


XOR s8_i (4) 












XOR 


s8 


_i (5) 


XOR s8_xl; 


/•> 

so 


_o (3) 


< = 




_i (3) 


XOR 


s8^ 


_i (4) 


XOR s 8_i ( 5 ) 












XOR 


s8 


_i (6) 


XOR s8_x2; 


s8 


_o (4) 


< = 


s8_ 


_i (4) 


XOR 


s8_ 


_i (5) 


XOR s8_i (6) 












XOR 


s8_ 


_i(7) 


XOR s8_x3; 


5 8 


_o(5) 


<= 


s8_ 


_i (5) 


XOR 


s8_ 


_i (6) 


XOR s8_i (7) 












XOR 


s8 


_xl XOR s8_x3; 


s8_ 


_o(6) 


<= 


s8_ 


_i (6) 


XOR 


s8 . 


_i (7) 


XOR s8_xl 












XOR 


s8_ 


_x2 XOR s8_x3; 


S8 


_o(7) 


<= 


s8_ 


_i(7) 


XOR 


s8_ 


_xl XOR s8_x2; 



END IF; 

END PROCESS tc_8; 

50 

output_8 <= s8_o; 

55 

END rtl; 
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5. Turbo Coder Realization 

[0083] While above reference has been made to Fig. 8 with respect to background art of the present invention this 
Fig. 8 is also related to the present invention as will be shown in the following. 
s [0084] According to the present invention described so far reference has been made to the parallelization of turbo 
coder blocks with M outputs. Nevertheless, it should be mentioned that according to the present invention the con- 
struction of a complete turbo coder requires the instantiation of the parallelized turbo coder block twice, as shown in 
Fig. 8 (where M = 1 is assumed in both blocks). 

[0085] One turbo coder block is related to the normal input data stream to derive OUTPUT 1 shown in Fig. 8 and 
70 the other is related to the interleaved input data stream to derive OUTPUT 2. Since the input data stream is also directly 
forwarded to OUTPUT 0 the three outputs of the turbo coder have a width of 3n bits per parallel clock cycle, where n 
is the degree of parallelization. 

75 Claims 

1. Parallel turbo coder implementation method starting from a serial turbo coder block description according to 

20 X!<t) = I(t-l) © ai-xift-l) © a 2 x 2 (t-l) © ... © a N -x N (t-l) 

ai e {0, 1} 

x 2 (t) = X! (t-1) 



30 x N (t) = x N _! (t-1) 

Qj (t) = PjO'I (t-l) © 

xi(t-l) tP j0 -ai © Pji] © 

35 



40 



45 



x N (t-l)-[P j0 -a N © p jN ] 

Pij e {0, 1} 

j 8 [1, . . . , N] 

comprising the steps: 

a) carrying out a time index substitution for the first internal state according to: 



so 



55 
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x^t-l) = I(t-2) © a 1 -x 1 (t-2) © 

a 2 -x 2 (t-2) © ... © 
a N -x N (t-2) 



(2.X!.!) 



xi(t-(n-l)) = I(t-n) © ai-xi (t-n) 
is a 2 -x 2 (t-n) 

a N x N (t-n) 



(2 .xi .n-1) 



where n Is the degree of parallelization; 

b) carrying out a time index substitution for the remaining internal states (U2, ...,N) according to: 
Xi(t-l) = x i _ 1 (t-2) 

(2.Xi.l) 



x ± (t- (n-1) ) - xi-i (t-n) 

(2 .Xi .n-1) 

c) carrying out a time index substitution for the output signal according to 

Qj (t-i) = PjQ-I (t-(i+l) ) © 

Xitt-ti+l) J-iPjo-oti © Pjil © 



x N (t-(i+l) )-[pjo-aN © PjN^ 

i e [1, . . . , n-1] 
(2.Q.i) 



55 to derive a parallel output vector: 
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Qj (t) 
Qj (t-1) 



= QjO(p) 
= QjKp) 



10 

Qj (t- (n-1) ) = Qjn-1 (p) 

j 6 [1, . . . , M] 

d) carrying out an internal state substitution process for each internal state x k (t) (k=1 , .... N) according to the 
following sub-steps: 

d1) set maximum time index element for the internal state x k (t) to t max = t-1; 

d2) scan the representation of the internal state x k (t) for internal states with maximum time index t max ; 

d3) execute backward time index transition from t max to t max -1 in the representation of the internal state 
x k (t) through state variable substitution steps using eqs. (2); and 

d4) decrement t max by a value of 1 and repeat steps d2) to d4) while t max > t-n; 

e) carrying out an internal state substitution process for each element Qj(t-i) (i- 0, n-2) of each parallel 
output vector Qj(t) , .... M) according to the following sub-steps: 

e1 ) set maximum time index element for vector element Qj(t-i) in the considered parallel output vector Qj 
(t) to t max =t-i-1; 

e2) scan the representation of the vector element Qj(t-i) for internal states with maximum time index; 

e3) execute backward time index transition in the representation of the vector element Qj(t-i) from t niax to 
t max -1 through state variable substitution steps using eqs. (2): and 

e4) decrement t max by a value of 1 and repeat steps e2) to e4) while t max > t-n. 

2. Parallel turbo coder block of parallelization degree n, comprising 

a) means (I0 : .... 17) to store n samples (l(t-1). .... l(t-n))of an input signal l(t) to the parallel turbo coding block; 

is b) at least one means (Q0, ... s Q7) to store n samples (Qj(t), .... Qj(t-(n-l)) of at least one output signal Qj(t) 

(j=1, .... M) of the parallel turbo coding block; and 

c) lurbo coder means comprising a bank of n delay unils(X 1 , .... X N ) and being adapted to a parallel processing 
of n samples of the input signal l(t) such that at least two delay units (X 1( .... X N ) of the bank directly receive 
so subsets of the n samples (l(M ), .... I(t-n)) of the input signal l(t) and an output signal of at least one delay unit 

(X^ .... X N ) in the turbo coder means is supplied to at least two delay units in the turbo coder means. 

3. Parallel turbo coder block according to claim 2, characterized in that the turbo coder means has a structure being 
derivable from a serial turbo coder block description according to 

ss 



15 



20 



25 



30 



35 



40 
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TO 



20 



25 



Xl (t) = I(t-l) © ajxilt-l) © a 2 -x 2 (t-l) © ... © a N x N (t-l) 

oii e {0, 1} 

x 2 (t) = X! (t-1) 



x N (t) = xn-i ( t-1 ) 
Qj (t) = pjo-I (t-1) © 
« Xl (t-l) [pj 0 oci © Pjl] © 



x N (t-l)-[p j0 cx N © p jN ] 

Pji E {0, 1} 

j e [1, . . M] 



through the parallel turbo coder implementation method of claim 1 . 



Parallel turbo coder block according to claim 2 or 3, characterized in that the parallelization degree is 2. that N = 
3. M = 1 . a = [0. 1 . 1 ]. that (3 = [1 , 1,0, 1 ], and that the structure of the turbo coder means is 

Q0(p) = I0(p-1 ) e 1 1 (p-1 ) e xt (p-1 ) © x 2 (p-i ) © x 3 (p-i ) 

Q1 (p) = 1 1 (p-1 ) © (p-1 ) © x 2 (p-1 ), 
wherein 

x 1 (p) = IO(p-1)©x 1 (p-1)©x 2 (p-1) i 
x 2 (p) = 1 1 (p-1 ) © x 2 (p-1 ) © x 3 (p-1 ), and 
x 3 (P) = x,(p-1). 

40 5. Parallel turbo coder block according to claim 2 or 3 : characterized in that the parallelization degree is 4, that N = 
3, M = 1, a = [0, 1 , 1], that |3 = [1 , 1,0, 1 ], and that the structure of the turbo coder means is 



Q0(p) = I0(p-1) © 11 (p-1 ) © I2(p-1) © I3(p-1) © x 3 (p-1 ) 
Q1 (p) = 11 (p-1 ) © I2(p-1) © I3(p-1) © X^p-1 ) © x 3 (p-1 ) 
Q2(p) = I2(p-1 ) © I3(p-1 ) © X! (p-1 ) © x 2 (p-1 ) © x 3 (p-1 ) 
Q3(p) = I3(p-1) ©x^p-1) ©x 2 (p-1), 



wherein 

x^p) = I0(p-1) © I2(p-1) © I3(p-1) © 
so x 1 (P-1)©x 3 (p-1), 

x 2 (p) = 11 (p-1) ©I3(p-1) © 

x^p-1) ©x 2 (p-1) © x 3 (p-1), and 
x 3 (p) = l2(p-1)©x l (P-1)©x 2 (p-1). 

55 6. Parallel turbo coder block according to claim 2 or 3 : characterized in that the parallelization degree is 8, that N = 
3, M = 1 , a - [0, 1 , 1 ], that (5 = [1 , 1,0, 1], and that the structure of the turbo coder means is 
Q0(p) = I0(p-1) © 11 (p-1) © I2(p-1) © I3(p-1) © 
!6(p-1)©x 1 (P-1)©x 2 (p-1) 
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Q1 (p) = 11 (p-1 ) © I2(p-1 ) © I3(p-1 ) © I4(p-1 ) © 

I7(p-1)©x 2 (p-1)©x 3 (p-1) 
Q2(p) = I2(p-1 ) © I3(p-1 ) © I4(p-1 ) © I5(p-1 ) © Xl (p-1 ) 
Q3(p) =1 3(p-1 ) © I4(p-1 ) © I5(p-1 ) © I6(p-1 ) © x 2 (p-1 ) 
5 Q4(p) = I4(p-1 ) © I5(p-1 ) © I6(p-1 ) © I7(p-1 ) © x 3 (p-1 ) 

Q5(p) = I5(p-1) © I6(p-1) © 17(p-1) ©x^P-1) ©x 3 (p-1) 
Q6(p) = I6(p-1 ) © I7(p-1 ) © Xl (p-1 ) © x 2 (p-1 ) © x 3 (p-1 ) 
Q7(p) = l7(p-1)©X 1 (P-1)©x 2 (p-1) ) 
wherein 

10 Xl (p) = I0(p-1) © I2(p-1) © I3(p-1) © 

I4(p-1)©l7(p-1)©x 2 (p-1)©x 3 (p-1) l 
x 2 (p) = H (p-1) © I3(p-1 ) © I4(p-1 ) © I5(p-1 ) © 
(p-1), and 

x 3 (p) = I2(p-1 ) I4(p-1 ) © I5(p-1 ) © I6(p-1 ) © x 2 (p-1 ). 

75 

7. Computer programm product directly loadable into an internal memory ot a computer 

comprising software code portions for performing the steps according to claim 1 
when the product is run on a computer. 

20 

8. Computer programm product according to claim 7, characterized in that the software code portions are of the 
VHDL type. 

9. Computer programm product according to claim 8, characterized in that the software code portions are defined to: 

2S 



30 



35 



40 



45 



SO 



55 



JSDOCID: <EP 1085660AV \_ > 



29 



EP 1 085 660 A1 



LIBRARY ieee; 

USE ieee . std__logic_l 164 .ALL; 
USE ieee. std logic_arith . ALL; 



10 



15 



20 



25 



30 



35 



ENTITY en_turbo_coder_rtl IS 
PORT ( 

General: 
reset_jp : 
clk32m : 



input_8 
input_4 
input_2 
input_l 



IN STD_LOGIC; 

IN STDJLOGIC; — Clock 

(rising edge triggered) 
IN -std_logic_vector (7 D0WNT0 0} 
IN std__logic_vector (3 DOWNTO 0) 
IN std_logic_vector {1 DOWNTO 0) 
IN std logic; 



— turboCoding 2 bit parallel 



output_8 
output_4 
output__2 
output_l 

) ; 

END en turbo coder rtl; 



OUT std_logic_vector (7 DOWNTO 0) ; 

OUT std_logic_vector (3 DOWNTO 0) ; 

OUT std_logic_vector (1 DOWNTO 0) ; 
OUT std logic 



40 



ARCHITECTURE rtl OF en turbo coder rtl IS 



45 



SO 



SIGNAL sl_xl 
SIGNAL sl_x2 
SIGNAL sl_x3 
SIGNAL sl_i 
SIGNAL si o 



std_logic; 
std_logic; 
std_logic; 
std_logic; 
std_logic; 



55 
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SIGNAL 


s2 


_xl 


SIGNAL 


s2 


_x2 


SIGNAL 


s2 


_x3 


SIGNAL 


s2 


_i 


SIGNAL 


S2 


o 
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std_logic; 
std_logic; 
std_logic; 

std_logic_vector (1 DOWNTO 0) ; 
std_logic_vector (1 DOWNTO 0) ; 



10 





SIGNAL 


s4_xl 


: std_logic; 








SIGNAL 


s4_x2 


: std_logic; 






15 


SIGNAL 


s4_x3 


: std_logic; 








SIGNAL 


s4 i 


: std_logic_vector ( 3 


DOWNTO 


0) ; 


20 


SIGNAL 


s4_o 


: std logic_vector ( 3 


DOWNTO 


0) ; 


SIGNAL 


s8_xl 


: std_logic; 








SIGNAL 


s8_x2 


: std_logic; 






25 


SIGNAL 


s8_x3 


: std_logic; 








SIGNAL 


s8 i 


: std_logic_vector (7 


DOWNTO 


0) ; 




SIGNAL 


s8_o 


: std_logic_vector ( 7 


DOWNTO 


0) ; 



30 



BEGIN 



35 


tc 1: PROCESS (clk32m, reset 




BEGIN 




40 


IF reset_p = 


' 1 1 THEN 




si xl <= 


' 0 f ; 


45 


sl_x2 <= 


■0' ; 




sl_x3 <= 


' 0' ; 




sl_i <= 


' 0 f ; 


SO 


si o <= 


•0' ; 



ELSIF clk3 2m 1 EVENT AND clk32m 



THEN 



55 



4SOOCID: <EP 1095660A1 J_> 



31 



EP 1 085 660 A1 



sl_i <= 

sl_xl <= 

sl_x2 <= 

sl_x3 <= 

si o <= 



input_l ; 

sl_i XOR sl_x2 XOR sl_x3; 

sl_xl; 

sl_x2; 

si i XOR si x2 XOR si xl; 



10 



END IF; 



75 

END PROCESS tc_l ; 
20 output_l <= sl_o; 



tc_2: PROCESS (clk32m, reset_p) — 2bit par building of TC 

25 

BEGIN 



30 IF reset_p = ' l 1 THEN 



s2_xl <= ' 0 1 ; 

s2_x2 <= '0'; 

s2_x3 <= ' 0» ; 

s2_i <= (OTHERS => f 0'); 

s2 o <= (OTHERS => 1 0 T ) ; 



35 



40 



ELSIF clk32m f EVENT MD clk32m = ' 1' THEN 
s2_i <- input__2; 

s2_xl <= s2_i(0) XOR s2__xl XOR s2_x2; 
s2_x2 <= s2_i(l) XOR s2_x2 XOR s2_x3; 
s2_x3 <= s2_xl; 
s2_o (0) <= s2_i (0) XOR 

s2 i(l) XOR s2 xl XOR s2 x2 XOR s2 x3; 
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s2_o(l) <= s2_i(l) XOR s2_xl XOR s2_x2; 

END IF; 

END PROCESS tc_2; 
output_2 <= s2_o; 

tc_4 : PROCESS (clk32m, reset_p) — 4bit par building of 
BEGIN 

IF reset_p = ' 1 * THEN 

s4_xl . <= ' 0 ' ; 

s4_x2 <= '0'; 

S4_x3 <= '0'; 

s4_i <= (OTHERS => '0'); 

S 4_o <= (OTHERS = > '0'); 

ELS IF clk3 2m 'EVENT AND clk32m = '1' THEN 



s4 


i 


<= 


input_4 


t 






S 4_ 


_xl 


<= 


s4_i (0) 


XOR 


s4_ 


_i (2) XOR s4_i (3) 










XOR 


S4_ 


_xl XOR s4_x3; 


s4_ 


_x2 


<= 


s4_i (1) 


XOR 


s4 


_i (3) XOR s4_xl 










XOR 


s4 


_x2 XOR s4_x3; 


s4_ 


_x3 


<= 


s4_i (2) 


XOR 


s4_ 


_xl XOR s4_x2; 


s4_ 


_o(0) 


<= 


s4_i (0) 


XOR 


S 4 


_i (1) XOR s4_i (2) 










XOR 


S4 


_i (3) XOR s4_x3; 


s4 


_o (1) 


<= 


s4_i (1) 


XOR 


s4 


_i (2) XOR s4_i (3) 










XOR 


s4 


xl XOR S4_x3; 


s4 


_o (2) 


<= 


s4_i (2) 


XOR 


s4 


_i (3) XOR s4_xl 
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XOR s4_x2 XOR s4_x3; 

5 s4_o(3) <= s4_i(3) XOR s4_xl XOR s4_x2; 

END IF; 

10 

END PROCESS tc_4 ; 
is output__4 <= s4_o; 

20 tc_8 : PROCESS (clk32m, reset_p) — 8bit par building of TC 

BEGIN 

25 

IF reset_p = ' l f THEN 

<= r 0 ' ; 
<= ' 0 1 ; 
f 0 ' ; 

<= (OTHERS => ' 0 1 ) ; 
<= (OTHERS => ! 0' ) ; 

ELSIF clk32m ? EVENT AND clk32m = THEN 





s8 


_i 


<= 


input_8 


t 






45 


s8 


_xl 


<= 


s8_i (0) 


XOR 


s8_i(2) XOR 


s8_i (3) 












XOR 


S8_i(4) XOR 


s8_i (7) 












XOR 


s8_x2 XOR s8 


x3 ; 


SO . 


s8 


_x2 


<= 


s8_i (1) 


XOR 


s8_i(3) XOR 


s8_i (4) 












XOR 


s8_i(5) XOR 


s8_xl ; 




s8 


_x3 


<= 


s8_i (2) 


XOR 


s8_i(4) XOR 


s8_i (5) 


55 










XOR 


s8_i(6) XOR 


s8_x2 ; 




s8 


_o{0) 


<= 


s8_i (0) 


XOR 


s8_i(l) XOR 


s8_i (2) 
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XOR s8_i(3) XOR s8_i(6) 
XOR s8_xl XOR s8_x2; 
s8_o(l) <= s8_i(l) XOR s8_i(2) XOR s8_i(3) 

XOR s8_i(4) XOR s8_i(7) 
XOR s8_x2 XOR s8_x3; 
s8_o(2) <= s8_i(2) XOR s8_i(3) XOR s8_i(4) 

XOR s8_i(5) XOR s8_xl; 
s8_o(3) <= s8_i(3) XOR s8_i(4) XOR s8_i(5) 

XOR s8_i(6) XOR s8_x2; 
s8_o(4) <= s8_i(4) XOR s8_i(5) XOR s8_i(6) 

XOR s8_i(7) XOR s8_x3; 
s8_o(5) <= s8_i(5) XOR s8_i(6) XOR s8_i(7) 

XOR s8_xl XOR s8__x3; 
s8_o(6) <= s8_i(6) XOR s8_i(7) XOR s8_xl 

XOR s8_x2 XOR s8_x3; 
s8_o(7) <= s8_i(7) XOR s8_xl XOR s8__x2; 

END IF; 
END PROCESS tc_8; 

35 

output_8 <- s8_o; 
40 END rtl; 

45 
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FIG. 7 A 

8 BIT PARALLEL TURBO BLOCK- INPUT PART 
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FIG. 7B 

8 BIT PARALLEL TURBO CODING BLOCK- PART FOR OUTPUTS Q 
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